Relating Query Popularity and File Replication in the Gnutella Peer-to-Peer Network
نویسندگان
چکیده
In this paper, we characterize the user behavior in a peer-to-peer (P2P) file sharing network. Our characterization is based on the results of an extensive passive measurement study of the messages exchanged in the Gnutella P2P file sharing system. Using the data recorded during this measurement study, we analyze which queries a user issues and which files a user shares. The investigation of users queries leads to the characterization of query popularity. Furthermore, the analysis of the files shared by the users leads to a characterization of file replication. As major contribution, we relate query popularity and file replication by an analytical formula characterizing the matching of files to queries. The analytical formula defines a matching probability for each pair of query and file, which depends on the rank of the query with respect to query popularity, but is independent of the rank of the file with respect to file replication. We validate this model by conducting a detailed simulation study of a Gnutella-style overlay network and comparing simulation results to the results obtained from the measurement.
منابع مشابه
Search Performance Analysis in Peer-to-Peer Networks
Recently Peer-to-Peer networks (P2P) have gained great attention and popularity. One key challenging aspect in P2P resource sharing environments is efficient searching algorithm. This is especially important for Gnutella-like decentralized and unstructured networks since they have power-law degree distributions. A robust search algorithm should respond to the query message promptly without gene...
متن کاملImproving QoS on Gnutella
The use of peer-to-peer systems for sharing information, files and other resources has risen dramatically over the last three years. File sharing is the ‘killer app’ that has driven this explosion in popularity. The first generation of peer-to-peer file sharing systems, including Napster, Morpheus and Kazaa, followed the traditional client-server paradigm. However, concerns over the legality an...
متن کاملContent Location in Peer-to-peer Systems: Exploiting Locality
Efficient content location is a fundamental problem for decentralized peer-to-peer systems. Gnutella, a popular file-sharing application, relies on flooding queries to all peers. Although flooding is simple and robust, it is not scalable. In this chapter, we explore how to retain the simplicity of Gnutella while addressing its inherent weakness: scalability. We propose two complementary content...
متن کاملImplications of the file names and user requested queries on Gnutella performance
The Gnutella file sharing system allows a large number of peers to share their local files. However, it does not coordinate the way by which these shared objects are named or how they are searched by other users; such decisions are made independently by each peer. In this work, we investigate the practical performance implications of this design. We collected the shared filenames and user gener...
متن کاملCost-effective broadcast for fully decentralized peer-to-peer networks
Recently, there has been a growing interest in peer-to-peer networks, sparked by the popularity of file sharing applications such as Napster and Gnutella. A typical characteristic of a peer-to-peer system is that all the nodes are equal participants in the network. Gnutella is an example of a ‘pure’ peer-to-peer system, being fully distributed where all nodes are equal and no special nodes with...
متن کامل